首页> 外文OA文献 >Exploring SIMD for molecular dynamics, using Intel Xeon processors and Intel Xeon Phi coprocessors

【2h】

Exploring SIMD for molecular dynamics, using Intel Xeon processors and Intel Xeon Phi coprocessors

机译：使用Intel Xeon处理器和Intel Xeon Phi协处理器探索SIMD的分子动力学

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

We analyse gather-scatter performance bottlenecks in molecular dynamics codes and the challenges that they pose for obtaining benefits from SIMD execution. This analysis informs a number of novel code-level and algorithmic improvements to Sandia's miniMD benchmark, which we demonstrate using three SIMD widths (128-, 256- and 512-bit). The applicability of these optimisations to wider SIMD is discussed, and we show that the conventional approach of exposing more parallelism through redundant computation is not necessarily best. \ud\udIn single precision, our optimised implementation is up to 5x faster than the original scalar code running on Intel Xeon processors with 256-bit SIMD, and adding a single Intel Xeon Phi coprocessor provides up to an additional 2x performance increase. These results demonstrate: (i) the importance of effective SIMD utilisation for molecular dynamics codes on current and future hardware; and (ii) the considerable performance increase afforded by the use of Intel Xeon Phi coprocessors for highly parallel workloads.

机译：我们分析了分子动力学代码中的聚集散射性能瓶颈，以及它们从SIMD执行中获得收益所带来的挑战。该分析为Sandia的miniMD基准测试提供了许多新颖的代码级和算法改进，我们使用三种SIMD宽度（128位，256位和512位）进行了演示。讨论了这些优化方法对更广泛的SIMD的适用性，并且我们证明了通过冗余计算公开更多并行性的常规方法不一定是最好的。 \ ud \ ud在单精度方面，我们的优化实现比在具有256位SIMD的Intel Xeon处理器上运行的原始标量代码快5倍，并且添加单个Intel Xeon Phi协处理器可将性能提高2倍。这些结果证明：（i）有效利用SIMD对于当前和将来的硬件上的分子动力学代码的重要性；（ii）通过使用英特尔至强融核协处理器处理高度并行的工作负载，可显着提高性能。

著录项

作者
Pennycook, Simon J.; Hughes, C. J.; Smelyanskiy, M.; Jarvis, Stephen A.;
展开▼
作者单位

展开▼
年度 2013
总页数
原文格式 PDF
正文语种 {"code":"en","name":"English","id":9}
中图分类

相似文献

外文文献
中文文献
专利

1. Effective SIMD Vectorization for Intel Xeon Phi Coprocessors [J] . XinminTian, HidekiSaito, Serguei V.Preis, Scientific programming . 2015,第4期

机译：适用于英特尔至强融核协处理器的有效SIMD矢量化
2. Effective SIMD Vectorization for Intel Xeon Phi Coprocessors [J] . Tian Xinmin, Saito Hideki, Preis Serguei V., Scientific programming . 2015,第期

机译：适用于英特尔至强融核协处理器的有效SIMD矢量化
3. Beacon: Exploring the Deployment and Application of Intel Xeon Phi Coprocessors for Scientific Computing [J] . Brook R. Glenn, Heinecke Alexander, Costa Anthony B., Computing in science & engineering . 2015,第2期

机译：灯塔：探索用于科学计算的英特尔至强融核协处理器的部署和应用
4. Exploring SIMD for Molecular Dynamics, Using Intel® Xeon® Processors and Intel® Xeon Phi Coprocessors [C] . IEEE International Parallel Distributed Processing Symposium . 2013

机译：使用英特尔®至强®处理器和英特尔®至强融核协处理器探索分子动力学的SIMD
5. Advancing LAMMPS Performance on Intel Xeon Phi Processors Coprocessors [D] . Vorsu, Sandeep Kumar. 2017

机译：在英特尔Xeon Phi处理器协处理器上推进LAMMPS性能
6. Efficient irregular wavefront propagation algorithms on Intel® Xeon Phi™ [O] . Jeremias M. Gomes, George Teodoro, Alba de Melo, -1

机译：英特尔®至强融核™上的高效不规则波前传播算法
7. Explicit Fourth-Order Runge–Kutta Method on Intel Xeon Phi Coprocessor [O] . Beata Bylina, Joanna Potiopa 2016

机译：英特尔至强融核协处理器上的显式四阶Runge–Kutta方法

Exploring SIMD for molecular dynamics, using Intel Xeon processors and Intel Xeon Phi coprocessors

摘要

著录项

相似文献

相关主题

期刊订阅